Picture for Qiang Liu

Qiang Liu

Linda

Learning When Not to Act: Mitigating Tool Abuse in Agentic Reinforcement Learning

Add code
Jun 01, 2026
Viaarxiv icon

GAPD: Gold-Action Policy Distillation for Agentic Reinforcement Learning in Knowledge Base Question Answering

Add code
May 28, 2026
Viaarxiv icon

EarlyTom: Early Token Compression Completes Fast Video Understanding

Add code
May 28, 2026
Viaarxiv icon

Training-Free Looped Transformers

Add code
May 22, 2026
Viaarxiv icon

CRAFT: Conflict-Resolved Aggregation for Federated Training

Add code
May 20, 2026
Viaarxiv icon

XDecomposer: Learning Prior-Free Set Decomposition for Multiphase X-ray Diffraction

Add code
May 07, 2026
Viaarxiv icon

Policy Gradient Primal-Dual Method for Safe Reinforcement Learning from Human Feedback

Add code
Apr 21, 2026
Viaarxiv icon

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Add code
Mar 30, 2026
Viaarxiv icon

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Add code
Mar 26, 2026
Viaarxiv icon

Gumbel Distillation for Parallel Text Generation

Add code
Mar 23, 2026
Viaarxiv icon